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Although there is increasing evidence that vims-specific cytotoxic-T-lymphocyte (CTL) responses play an 
important role in the control of human immunodeficiency virus (HIV) replication in vivo, only scarce CTL data 
are available for the ethnic populations currently most affected by the epidemic. In this study, we examined the 
CDS^-T-cell responses in African-American, Caucasian, Hispanic, and Caribbean populations in which clade 
B virus dominates and analyzed the potential factors influencing immune recognition. Total HIV-specific 
CD8'^-T-cell responses were determined by enzyme-linked immunospot assays in 150 HIV-infected individuals 
by using a clade B consensus sequence peptide set spanning all HIV proteins. A total of 88% of the 410 tested 
peptides were recognized, and Nef- and Gag-specific responses dominated the total response for each ethnidty 
in terms of both breadth and magnitude. Three dominantly targeted regions within these proteins that were 
recognized by >90% of individuals in each ethnicity were identified. Overall, the total breadth and magnitude 
of CD8"^-T-cell responses correlated with individuals' CD4 counts but not with viral loads. The frequency of 
recognition for each peptide was highly correlated with the relative conservation of the peptide sequence, the 
presence of predicted immunoproteasomal cleavage sites within the C-terminal half of the peptide, and a 
reduced frequency of amino acids that impair binding of optimal epitopes to the restricting class I molecules. 
The present study thus identifies factors that contribute to the immunogenicity of these highly targeted and 
relatively conserved sequences in HIV that may represent promising vaccine candidates for ethnically heter- 
ogeneous populations. 



Cytotoxic T lymphocytes (CTL) recognizing HLA class I-re- 
stricted viral epitopes are considered an important arm of the 
antiviral immune defense in human immunodeficiency virus 
(HIV) infection (11). This is supported by CTL depletion and 
iminune-escape studies, as well as vaccination experiments in 
the simian immunodeficiency virus model, which suggest that 
robust CTL activity can potentially control infection (5, 6, 49). 
In humans, the temporal association of CTL responses with 
reduction in the viral load in early HIV infection and the 
association of HIV disease progression with certain HLA class 
I alleles and supertypes reinforce the findings of animal studies 
(33, 37, 43, 51, 52; E. Trachtenberg, B. Korber, C. Sollars, E. 
Hayes, R. Funkhouser, T. Kepler, P. Hraber, M. Hsu, H. Er- 
lich, and S. Wolinsky, abstract from the ASHI 28th Annu. 
Meet., Hum. Immunol. 63:832, 2002). However, even broad 
CJTL activity may not be able to avert subsequent superinfec- 
tion (3), and early and late escape from CTL responses can be 
associated v^rith increases in the viral load (27, 35). Such data 
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suggest that CTL responses to relatively conserved regions will 
be critical for the development of an effective vaccine, even 
though the inclusion of variable regions may also provide some 
benefit (44). 

The identification of highly immunogenic and conserved re- 
gions is complicated by HIV sequence diversity and differences 
in host genetics among different ethnic groups (17). Even in 
geographically restricted regions, such as large urban areas, 
where one HIV clade dominates the epidemic, the ethnic dis- 
tribution and HLA allele variability may complicate the se- 
lection of a promising vaccine candidate, particularly when 
subunit- or epitope-based vaccines are being considered. In 
addition, many studies, especially those performed early in the 
HIV epidemic in the United States, have focused on HIV- 
infected individuals of Caucasian descent, leading to a paucity 
of CTL data for the ethnicities in which the HIV epidemic 
currently spreads fastest, including African, Asian, and Carib- 
bean populations (1, 7, 16, 23, 31, 41, 42, 46). 

The present study was designed to identify regions in HIV 
clade B protein sequences that are frequently targeted by HIV- 
infected individuals from different ethnicities and which may 
therefore be particularly relevant for clade B-based vaccine de- 
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TABLE 1. Demographics, viral loads, and CD4 counts of 150 study subjects 




Parameter 




Value* 






African-Americans 


Caucasians 


Hispanics 


West Indians 


n 

Gender (male/female) 
Treatment (on/off) 
Median viral load 
No, of RNA copies/ml 
Median CD4 count 
No. of cells/Ml 


59 
45/14 
. 43/16 
497 

(<50-400,000) 
329 
(7-1,280) 


26 
20/6 
15/11 
272 

(<50-106,000) 
322 
(28-945) 


44 
32/12 
24/19 
5,654 
(<50->750,0b0) 
283 
(11-1,476) 


21 
•^11/10 
6/15 
11,300 
(<50->750,000) 
223 
(^792) 



' Numbers in parentheses indicate ranges of values observed among enrolled individuals. 



sign and testing. Initially, the HLA allele distributions among 
HIV-infected and uninfected African-American, Hispanic, or 
West Indian individuals living in the Boston area and in HIV- 
infected individuals in Barbados were determined and com- 
pared to those in Caucasians by performing intermediate- to 
high-resolution HLA typing. One hundred fifty HIV-infected 
individuals were then comprehensively tested for total HIV- 
specific CTL activity using a clade B consensus sequence-based 
overiapping-peptide (OLP) set in enzyme-linked immunospot 
(ELISpot) and intracellular cytokine-staining analyses. The 
most immunogenic regions across all tested ethnicities were 
then determined, and factors that might contribute to the im- 
munodominance of those regions were investigated. 

MATERIALS AND METHODS 

Study subjects. A total of 27S HIV-positive and HI V-negative subjects were 
recruited for the HLA-typing study from four hospitals in the Boston area and at 
the Queen Elizabeth Hospital in Barbados. Of these, 150 HIV-positive individ- 
uals at all clinical stages with detectable HIV-specific cellular responses were 
included in the CD8*-T-ceU study; all persons were included regardless of CD4 
count, viral load, or treatment status; 15 HIV-negative subjects were included as 
controls. The studies were approved by the institutional review boards of all 
participating hospitals, and all subjects provided written informed consent before 
being recruited. The study subjects' characteristics are summarized in Table 1. 

Lymphocyte separation. Peripheral blood mononuclear cells (PBMCs) were 
separated from whole blood by density gradient centrifugation (Histopaque 
1077; Sigma, St. Louis, Mo.) within 24 h of venipuncture for individuals recruited 
in the Boston area and were used directly in the in vitro assays described below. 
Blood samples from individuals recruited in Barbados were processed after 
transport to Boston, leading to a delay of 24 to 48 h in lymphocyte separation for 
15 of the 21 samples. Of these, six were analyzed after being frozen, while for all 
other samples, the initial ELISpot screening and reconfirmation were carried out 
on freshly isolated PBMCs. Frozen samples were thawed, and the cells were 
rested in RPMI 1640 containing 10% heat-inactivated fetal calf serum (both 
from Sigma), 2 mM L-glutamine, 50 U of penicillin/ml, 50 jtg of streptomycin/ml, 
and 10 mM HEPES (all from Mediatech, Hemdon, Va.) at 3rC and 5% CX>2 for 
6 to 20 h prior to use in ELISpot assays. 

HLA typing. High- and intermediate-resolution HLA class I typing was per- 
formed at the HLA-typing laboratory of the Department of Transfusion Medi- 
cine, National Cancer Institute, National Institutes of Health, by sequence- 
specific primer PCR as previously described (12). DNA for typing was extracted 
using the Puregene DNA isolation kit for blood (Centra Systems, Minneapolis, 
Minn.) according to the manufacturer's instructions. 

HMA- The hcteroduplex mobility assay (HMA) was performed. as previously 
described (20; http://ubik.microbiol.washington.edu/HMA/index.html). Briefly, 
PCR-amplified fragments corresponding to the C2-V5 region of HIV type 1 
(HIV-1) Env from both the unknown specimen and a HIV-1 subtype reference 
were mixed» melted at 94°C, rcannealed on ice, and then displayed by electro- 
phoresis on a 5% neutral polyaciylamide gel, followed by staining with ethidium 
bromide. Heteroduplexes formed between the unknown sample and the most 
closely related reference sequences exhibit the fastest mobilities and thus identify 
the subtype of the unknown strain. 

Peptides. A set of 410 OLPs was used to screen for HIV-specific T-cell 
responses. The peptides spanned all HIV proteins and were based on the con- 



sensus B sequence of 2001 available at the HIV immunology database (http: 
//hiv-web.lanl.gov/contcnt/hiv-db/CONSENSUS/M_GROUP/Consensus.html), 
The peptides were generally 18-mers, varying from 15 to 20 amino acids in length 
and overlapping by 10 amino acids. The length variation was caused by the 
exclusion of amino acids that are rarely found in C-terminal anchor motife of 
HLA class I molecules from the C terminus (15), A previously described 
algorithm was applied for peptide design (http://hiv-web.lanl.gov/content/hiv-db 
/PEPTGEN/2001.html). This approach has recently been shown to provide a 
higher rate of peptide recognition than using an unadapted set of 18-meis (21). 
Peptides were synthesized at the peptide synthesis facUity of the Massachusetts 
General Hospital using 9-fluorenylmethyloxycaibonyl chemistiy or were ob- 
tained from Research Genetics, Huntsville, Ala. The peptides were tested in a 
peptide pool matrix to facilitate comprehensive screening as described previously 
(1). Reconfinnations of all positive wells in the matrix screen were done the 
following day on a single-peptide basis. The numbering of peptides was as 
foUows: Gag-pl7 OLP 1 to OLP 17, Gag-p24 OLP 18 to OLP 48, Gag-pl5 OLP 
49 to OLP 66, Nef OLP 67 to OLP 93, Rev OLP 94 to OLP 108, Tat OLP 111 
to OLP 122, Vpu OLP 123 to OLP 131, Pol-protease OLP 145 to OLP 164, 
Pol-reverse transcriptase (RT) OLP 165 to OLP 240, Pol-integrase OLP 241 to 
OLP 277, Vpr OLP 278 to OLP 288, Env-gpl20 OLP 289 to OLP 365. Env-gp41 
OLP 366 to OLP 401, and Vif OLP 402 to OLP 425, 

ELISpot assays. PBMCs were plated in 96-well polyvinylidene difluoride 
plates (Millipore, Bedford, Mass.) which had been preooated overnight with 0.5 
\Lg of the anti-gamma interferon (IFN-7) monoclonal antibody 1-DlK (Mabtech, 
Stockholm, Swcden)/ml. Each peptide was added at a final concentration of 14 
p.g/ml (for both single peptides and pooled peptides). A total of 100,000 cells per 
well were added in 140 ^1 of RPMI 1640 containing 10% heat>inacthrated fetal 
calf serum, 2 mM L-glutamine, 50 U of penicilUn/inl, SO p.g of streptomydn/ml, 
and 10 mM HEPES. For negative controls, cells were incubated in medium 
alone. Phytohemagglutinin (Sigma) was added at a concentration of 1.8 p-g/inl to 
serve as a positive control. The cells were incubated overnight at 3TC with 5% 
CO2. The plates were developed by washing them six times with phosphate- 
buffered saline (no Ca or Mg; Mediatech), and OS |ig of btotinylated anti-IFN-7 
monoclonal antibody 7-B6-1 (Mabtech)/ml was added for 1 h at room temper- 
ature. The plates were again washed and incubated with a 1:2,000 dilution of 
streptavidin-coupled alkaline phosphatase (Mabtech) for 1 h at room temperia- 
ture in the dark. The plates were washed again, and IFN-7 production was 
detected as blue spots after a short incubation with nitroblue tetrazoUum and 
S-bromo-4-chloro-3-indolyl phosphate (Bio-Rad, Hercules, Calif.). The color 
reaction was stopped by washing the plates witii tap water. The spots were 
counted using the AID ELISpot Reader unit (Autoinmiun Diagnostika GmbH, 
Strasbourg, Germany), and the results were expressed as spot-forming cells 
(SFC) per million input cells. 

Thresholds for positive responses were determined as at least fwe spots (50 
SFOIO^ PBMC5) per well and as responses exceeding the mean of negative wells 
plus 3 standard deviations and three times the mean, whichever was higher. In all 
tested subjects except thiee individuals, the mean of six or more negative control 
wells was <3S SFC/10* PBMCs, with an overall average of negathre wells (in- 
cluding the high-background subjects) of 11 SFQIO^ PBMD. In addition to the 
above cutoff, responses were considered positwc only if tiicy were detected again 
in tfie subsequent confirmational EUSpot assay using single peptides. Two 
HIV* and all 15 of the HIV^ individuals tested in a blinded fashion did not show 
detectable responses in ELISpot assays. Responses to peptides were previously 
shown to be mediated by CDS* T cells in depletion experiments using Ro- 
sctteSep CD4 and CDS reagents (StemCell Technologies, Vancouver, Canada) 
according to the manufacturer's directions (1. 23). Additionally, CD8*-T-cell 
dependence testing was performed on a subset of study subjects using flow 



Vol. 78, 2004 



■HIV-l-SPfiCIFIC CTL responses in diverse ethnicities 2189 



c 



B 



S 

I 

3 




01 02 03 11 23 24 25 26 29 30 31 32 33 34 36 66 68 74 



0^ 



0.15 



0.1 



0.05 



i 




JL 



i 



t 



1 



1 



i 



,n . n .n 



07 08 14 15 18 27 35 37 38 39 40 41 42 44 45 47 49 50 51 52 53 57 58 78 81 82 





0.3 




0^5 






c 


0.2 






a* 






0.15 






2 


0.1 








0.05 




0 




01 02 03 04 05 06 07 08 12 14 15 16 17 
FIG. 1. HLA allele frequencies in different ethnicities. HLA class I allele distributions are shown for HLA-A (A), HLA-B (B), and HLA-Cw 
(C) alleles. Frequencies are shown for 73 African-Americans (open bars), 75 Caucasians (lightly shaded bars), 47 Hispanics (darkly shaded bars), 
and 80 West Indians (solid bars). The asterisks mark alleles that show significant differences among ethnicities (Fisher*s exact test; P < 0.001). 



cytometric intracellular cytokine staining as previously described (8, 28) with the 
monoclonal antibodies anti-CDS-allophycocyanin, CD4-R-phycoerythrin, and 
IFN--y-fluoresccin isothiocyanatc (all from Becton Dickinson, San Jose, Calif.). 

Breadth and magnitude of responses. The breadth of the responses was 
assessed both by considering eveiy recognized peptide as a separate response 



and by considering recognition of two adjacent peptides as a response to one 
**epitopic region" (1). To enumerate cpitopic regions, the following algorithm 
was employed. When two adjacent peptides were recognized, only the stronger 
of the two responses was included. In the case of three adjacent peptides eliciting 
a response, the weakest of all three peptides was deleted, and die responses were 
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TABLE 2. Distribution of CTL responses within and 
between proteins 



Protein 


No. of peptides" 


% Of subjects'' 


% Of peptides' 


pl7 


17 


60.0 


94.1 


p24 


31 


86.0 


100 


pl5 


1 Q 


jV.f 




Nef 


27 


91.3 


96.3 


Rev 


15 


32.7 


93.3 


Tat 


12 


28.7 


83.3 


Vpu 


9 


11.3 


77.8 


Pro 


20 


25.3 


65.0 


RT 


76 


111 


90.8 


Int 


37 


76.0 


86.5 


Vpr 


11 


37.3 


100 


gpl20 


67 


65.3 


83.6 


gp41 


46 


57.3 


78.3 


Vif 


24 


40.0 


100 


Total 


410 


100 


88.3 



' Number of overlapping peptides spanning the protein sequence. 

Percentage of subjects targeting each of the proteins. 
' Percentage of peptides recognized within each protein by at least one indi- 
vidual. 



counted as two epitopic regions. All cases of responses to four, six, or eight 
adjacent peptides were considered multiple pairs and counted as two, three, or 
four epitopic regions, respectively. All cases where five and seven adjacent 
peptides were targeted were considered a triplet and one or two pairs and 
counted as three and four epitopic regions, respectively. The data set contained 
1,638 responses to single peptides, 691 cases of responses to two adjacent pep- 
tides, 112 cases of responses to three adjacent peptides, 36 cases of responses to 
four adjacent peptides, 15 cases of responses to five adjacent peptides, 3 cases of 
responses to six adjacent peptides, 2 cases of responses to seven adjacent pep- 
tides, and 1 case of responses to eight adjacent peptides. 

The total magnitude of responses was determined as the sum of the responses 
to all peptides and epitopic regions. The average magnitude was obtained by 
dividing the total magnitude by the number of targeted peptides. 

Overall, the number of responses expressed as single peptides was highly 
significantly associated with the number of epitopic regions {P < 0.0001; r = 
0.9903; Spearman rank correlation). Statistical analysis of correlations between 
the breadth or magnitude of responses and the viral load, as well as CD4 counts, 
yielded mostly identical P values when both counting approaches were applied. 
Therefore, data are presented based on counting the actual number of peptides 
that were recognized. 

Entropy, frequency of "forbidden** amino acids, and cleavage scores. As a 
measure of the variability of each of the peptides, the average entropy scores for 
all 410 OLPs were determined. The average entropy score for each peptide was 
calculated from Shannon entropy for each residue of B clade protein alignments, 
using only complete proteins and only a single protein from multisequence sets 
derived from a single patient (36, 47). The fraction of forbidden residues was 
determined based on the number of G, P, E, D, Q, N, T, S, and C residues 
divided by the number of residues in the peptide. These amino acids rarely serve 
as the C-terminal anchors of optimally defined CTL epitopes (15). Proteasomal- 
cleavage prediction scores were determined based on a cleavage prediction 
algorithm (34) that has been shown to be highly predictive of C termini in HIV-1 
epitopes (55). A cutoff score of ^0.7 (based on comparisons to known epitopes 
in the database) (55) was used, and the fraction of amino acid residues that had 
a score above the threshold was determined. Nine amino acids on either side of 
the residue whose cleavage potential was being estimated were used to provide 
the context for the neuraJ-net prediction (55). Only cleavage scores in amino 
acids in the C-terminal half of peptides were considered, starting at the eighth 
position in each peptide, thus avoiding high cleavage scores near the N terminus 
of a peptide, as high cleavage scores embedded within an epitope might have a 
counterbalancing effect of diminishing the number of intact epitopes processed 
for presentation. Stepwise multiple-regression analysis to model the contrSiutions of 
these variables to predicting the frequency of recognition for each peptide was 
carried out using Splus version 6.1.2 (Insightful Corp.). 

Statistical analysis. Statistical analysis was done using GraphPad Prism 
Macintosh version 3.0 and Splus version 6.1.2. The results are generally pre- 
sented as median values with ranges. Tests included the nonparametric Mann- 



Whitney test (two-tailed) for comparison between two ethnicities and treatment 
groups, the Kruskal-Wallis test for comparison among all four ethnicities, Spear- 
man's nonparametric rank test (two-tailed) for correlations, and Fisher's exact 
test for the analysis of the HLA allele distributions. Correlations were considered 
statistically significant when P was <0.05, unless multiple tests were performed, 
in which case Bonferroni correction was applied to correct for multiple subgroup 
analyses. Viral-load values below the limit of detection of 50 RNA copies/ml 
were assigned a value of 49 for statistical analyses; for values with a detection 
limit of 400 copies/ml, a value of 399 was used. Viral loads of >750,000 copies/ml 
were assigned a value of 750,001. 

All data have been made available to the public and can be accessed at 
http:/Avww.hhr.lanl.gov/content/immunology/hlatem/indexJitml. 

RESULTS 

Variable HLA allele frequencies in different ethnicities. The 

presentation of antigenic peptides to CTL is largely dependent 
on the abilities of processed peptides to bind to a restricting 
HLA class I molecule. HLA allele frequencies vary among 
diverse ethnic populations, and this variance may have a direct 
influence on the distribution, breadth, and strength of virus- 
specific CTL responses. To address these issues, the MHC 
class I allele distribution was determined for a total of 275 
HIV-infected and uninfected subjects from four different eth- 
nicities: 73 African-Americans, 75 Caucasians, 47 Hispanics, 
and 80 subjects from the West Indies (Barbados and Haiti). 
HLA-All and HLA-A25 distributions differed significantly 
among ethnicities after correcting for multiple subgroup tests 
(P = 0.0001 and P = 0.001, respectively) (Fig. 1). For seven 
additional HLA alleles (A03, A29, B27, B40, B57, CwOl, and 
Cw08), a trend toward different distributions among ethnicities 
was found (P < 0.05). 

Broad distribution of HIV*specific CD8'*'-T-ceU responses 
across all HTV proteins. Previous studies have reported com- 
prehensive mapping of CD8"^-T-cell responses (1, 7, 41), but 
the cohort sizes have been limited and responses have not been 
linked to multiple ethnicities. To investigate which regions of 
the HIV protein sequences were targeted by the cellular im- 
mune response, 150 subjects of African-American, Caucasian, 
Hispanic, or West Indian descent were studied comprehen- 
sively for HIV-specific CD8*-T-cell activity. To this end, 410 
OLPs based on a recent clade B consensus sequence (http://hiv 
-web,lanl.gov/content/hiv-db/PEPTGEN/2001.htnil) were used 
in a matrix-based IFN-7 ELISpot assay as described previously 
(1). 

Comprehensive screening for CD8'*"-T-cell responses iden- 
tified an unprecedented breadth in the distribution of recog- 
nized peptides. Overall, the studied subjects responded to a 
median of 19 peptides, ranging from 1 to 67 recognized pep- 
tides. Gag, Nef, and integrase were the most frequently tar- 
geted proteins (Table 2). Overall, 88% (362 of 410) of all 
peptides spread over all proteins served as targets for CD8"*' T 
cells. The highest density of targeted peptides was seen for Gag 
p24, Vpr, and Vif, for which 100% of peptides were recognized 
by at least one person. The least broadly targeted protein was 
protease, with 65% of peptides spanning this region being 
targeted (Table 2). 

Identification of several highly targeted regions across di- 
verse ethnicities. Viral-sequence variability is considered a ma- 
jor roadblock in HIV vaccine development. However, vaccine 
design may also need to consider differences in HLA allele 
distribution among different ethnic populations, which may 
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FIG. 2. CTL responses in four ethnic groups are distributed over the entire HIV genome and cluster in immunodominant regions. The total 
HIV-specific CD8*-T-cell activities for 59 African-Americans, 26 Caucasians, 44 Hispanics, and 21 West Indians were tested using an OLP set of 
410 peptides. The frequencies of recognition for each single peptide in the four ethnicities are shown. The horizontal box at the bottom mdicates 
the HIV proteins spanned by the OLPs. 
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TABLE 3. Peptides differentially targeted across ethnic groups 



OLP 


Protein 


Sequence 


Overall 
frequency of 
recognition 


HLA'* 


Frequency of recognition per ethnicity* 

AA C H 


WI 


P value 


6 


pl7 


ASRELERFAVNPGLL 


13.3 


All 


6.8 


26.9 


20.5 


0.0 


0.0068 


17 


pl7/p24 


AADTGNSSQVSQNYPIV 


2.7 


All, A25 


0.0 


15.4 


0.0 


0.0 


0.001 


35 


p24 


WMTNNPPIPVGEIYKRWI 


16.0 


BOS, B53 


25.4 


23.1 


2.3 


9.5 


0.0035 


36 


p24 


PVGEIYKRWIILGLNKIV 


22.0 


BOS, B27 


30.5 


42.3 


9.1 


4.8 


0.001 


194 


RT 


KIEELRQHLLRWGFTTPDK 


13.3 


B44 


5.1 


23.1 


25.0 


0.0 


0.0018 


207 


RT 


ELAENREILKEPVHGVYY 


11.3 


A2 


5.1 


30.8 


13.6 


0.0 


0,0028 



" HLA allele most commonly present among individuals with responses to OLP. 
AA, African-American; C, Caucasian; H, Hispanic; WI, West Indian. 



lead to the preferential targeting of some peptides in one 
ethnicity but not another (26). To address this point and to 
identify immunodominant regions that are equally or differen- 
tially targeted among different ethnicities, the distribution of 
responses was analyzed based on the ethnic descent of the 
tested subjects (Fig. 2). This analysis revealed a conserved 
pattern of responses for African- Americans, Caucasians, His- 
panics, and West Indians, with all proteins targeted by all 
ethnicities and with particularly strong responses to HIV Gag 
and Nef. In all four ethnic cohorts, Nef had the most frequent 
responses, which clustered within two peaks around peptides 
76 to 79 and 81 to 85. 

Despite these strong similarities in the overall distribution of 
responses among the various ethnicities, six peptides in Gag- 
pl7, Gag-p24, and RT were identified that were differentially 
targeted among groups {P < 0.01) (Table 3). For each of these 
peptides, an association between the frequency of recognition 
in a given ethnic group and a potentially restricting HLA allele 
could be determined (Table 3). 

Identification of immunodominant regions for vaccine de- 
sign. While ethnicity-specific targeting of some peptides may 
occur, many peptides were frequently targeted by a substantial 
proportion of individuals across all ethnicities. 

Peptides that were recognized by >15% of the total popu- 
lation are listed in Table 4. Nef and Gag stand out as contain- 
ing the most frequently targeted peptides, particularly peptide 
Nef-76, which was recognized most frequently in African- 
Americans and West Indians and was also the second and 
fourth most frequently recognized peptide in Hispanics and 
Caucasians, respectively. Importantly, some of the 33 peptides 
shown in Table 4 also scored high when their intraindividual 
immunodominances were analyzed (data not shown). For in- 
stance, the Nef-76 epitope was recognized by almost 50% of 
the entire cohort, and it also had the strongest response for 
15% of the individuals who targeted it. 

In order to achieve population coverage regardless of HLA 
type, we combined heavily targeted peptides in immunodom- 
inant regions in an attempt to identify specific regions that are 
simultaneously targeted by a majority of individuals in each 
ethnicity. Table 5 indicates that as few as three such regions, 
two consecutive stretches in Nef and one shorter region in 
Gag-p24, are sufficient to achieve cumulative recognition by 
>90% of individuals in each ethnicity. 

Although these three regions were sufficient to elicit re- 
sponses in the vast majority of our study cohort, an effective 
vaccine may need to induce broader responses to overcome 
sequence heterogeneity and CTL escape (3). We therefore 



extended our analyses to a total of 10 regions, each recognized 
by at least 20% of individuals and elongated by nine amino 
acids at the C terminus to facilitate proteasomal processing 
(34, 55) and which therefore may represent potential vaccine 
candidates for the vaccination of heterogeneous ethnic popu- 
lations in regions where clade B is endemic (Table 6). 

Treatment success reduces the breadth and magnitude of 
responses. The notion that treated subjects show narrower and 
weaker CD8'*'-T-cell responses (4, 29, 30, 32) was not con- 
firmed in this cohort when the treated group was analyzed as a 



TABLE 4. Frequently targeted and immunodominant regions 
in different ethnicities 



OLP 


Protein 


3 


pl7 


5 


pl7 


11 


pl7 


12 


pl7 


23 


p24 


24 


p24 


25 


p24 


29 


p24 


35 


p24 


36 


p24 


40 


p24 


41 


p24 


42 


p24 


59 


pl5 


76* 


Nef 


77 


Nef 


78 


Nef 


79 


Nef 


81 


Nef 


82 


Nef 


83 


Nef 


84 


Nef 


85 


Nef 


91 


Nef 


115 


Tat 


216 


RT. 


223 


RT 


224 


RT 


275 


RT 


281 


Vpr 


294 


gpl20 


314 


gpl20 


401 


gp41 



Sequence 



Total 



% Recognition' 



AA 



H WI 



EKIRLRPGGKKKYKLKHI 

KHIVWASRELERFAV 

TGSEELRSLYNTVATLY 

SLYNTVATLYCVHQRIEV 

AFSPEVIPMFSALSEGA 

PMFSALSEGATPQDLNTM 

GATPQDLNTMLNTVGGH 

AAEWDRLHPVHAGPIA 

WMTNNPPIPVGEIYKRWI 

PVGEIYKRWIILGLNKIV 

GPKEPFRDYVDRFYKTLR 

YVDRFYKTLRAEQASQEV 

LRAEQASQEVKNWMTETL 

RQANFLGKIWPSHKGR 

EVGFPVRPQVPLRPMTYK 

QVPLRPMTYKAAVDLSHF 

YKAAVDLSHFLKEKGGL 

SHFLKEKGGLEGLIYSQK 

QKRQDILDLWVYHTQGYF 

LWVYHTQGYFPDWQNY 

QGYFPDWQNYTPGPGIRY 

NYTPGPGIRYPLTFGWCF 

RYPLTFGWCFKLVPV 

PEKEVLVWKFDSRLAFHH 

FHCQVCFTTKGLGISYGR 

QKIATESIVIWGKTPKFK 

QLEKEPIVGAETFYVDGA 

GAETFYVDGAANRETKL 

KWPRRKAKIIRDYGKQM 

ELKNEAVRHFPRIWLHSL 

TVYYGVPVWKEATTTLF 

YRLISCNTSVITQACPKV 

HIPRRIRQGLERALL 



16.7 
16.7 
22.0 
25.3 
15.3 
16.0 
20.0 
20.0 



22.0 15.4 

23.7 19.2 

23.7 23.1 

30.5 26.9 
16.9 11.5 
203 15.4 

18.6 11.5 
18.6 23.1 

16.0 25.4 23.1 

22.0 30.5 423 

18.6 11.5 

22.0 30.8 

20,3 46.2 

18.6 30.8 

52.5 50.0 

40.7 38.5 
33.9 30.8 

20.3 30.8 

28.8 26.9 

25.4 38.5 
23.7 38.5 
42.4 38.5 

35.6 26.9 
22.0 lis 

16.9 15.4 
22.0 15.4 
2Z0 11.5 
203 
20.3 
32.2 
25.4 



15.9 
9.1 
22.7 
22.7 
18.2 
13.6 



4.8 
4.8 
4,8 
9.5 
9.5 
4.8 



18.2 38.1 
25.0 9.5 



23 
9.1 



9.5 
4.8 



16.7 
253 
22.7 
233 
44.7 
34.0 
30.0 
18.0 
26.0 
30.0 
30.7 
37.3 
32.0 
153 
16.7 
16.0 
153 
18.0 
153 
253 
22.7 
153 
16.7 



15.9 143 
31.8 143 



18.2 
31.8 



9,5 
9.5 



11.5 
7.7 
19.2 
19.2 



11.9 23.1 
153 23.1 



34.1 38.1 
29.5 19.0 

31.8 143 

15.9 0.0 
25.0 19.0 

31.8 28.6 

40.9 19.0 
31.8 333 

31.8 333 

18.2 9.5 
203 9.5 

9.1 143 

4.5 23.8 

18.2 19.0 

9.1 19.0 

20.5 23.8 

18.2 28.6 

18.2 9.5 

15.9 143 



" Values arc given as percent recognition of peptide by overall (total) and 
ethnic cohorts. AA, Aftican-Amcrican; C, Caucasian; H, Hispanic; WI, West 
Indian. 

* Peptide 76 is the most frequently targeted peptide across ethnicities. 
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TABLE 5. Three frequently targeted regions are recognized by >90% of subjects across ethnic groups 









% Recognition* 


Region 


Sequence 


Length* 


Total AA C H WI 


I 


EVGFPVRPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIYSQK 


41 


92.7 91.5 100 90.9 90.5 


II 


QKRQDILDLWVYHTQGYFPDWQNYTPGPGIRYPLTFGWCFKLVPV 


45 


III 


PRTLNAWVKWEEKAFSPEVIPMFSALSEGA 


31 





' Number of amino acids. 

* Cumulatwe percentage of subjects recognizing regions I to III. AA. African-American; C. Caucasian; H, Hispanic; WI, West Indian. 



whole (Fig. 3). Only when the treated subjects were stratified 
into a treatment-suppressed group (including subjects with a 
plasma viral load of ^200 RNA copies/ml) and those who were 
treated but had viral loads of >200 RNA copies/ml did the 
previously described decline in the breadth and magnitude of 
CD8'^-T-cell responses become apparent. While no difference 
between the breadths of responses in untreated and treated 
but unsuppressed subjects was observed, suppressive treatment 
led to a significant reduction in the number of targeted pep- 
tides (P = 0.0007). Similarly, the magnitude of the total CDS""- 
T-cell response was also significantly (P < 0.0001) reduced in 
the latter group compared to untreated subjects and individu- 
als for whom treatment failed. 

CD4 count, but not viral load, correlates with breadth of 
CD8"*'-T-cell responses. The observation that T-cell activity is 
diminished during successful therapy with very low or unde- 
tectable viral replication suggests that CTL responses and the 
viral load could be positively correlated. However, a number of 
reports have been published over the last few years that pro- 
vide conflicting data on the correlation between the CTL ac- 
tivity, viral load, and CD4 counts (1, 7, 13, 14, 18, 19, 22, 25, 42, 
45, 53). We therefore took advantage of the large number of 
untreated individuals in our cohort with a wide distribution in 
viral loads and CD4 counts to reassess these associations. 

While no correlation was found between the viral load and 
either the breadth or magnitude of the responses (Fig. 4A and 
C), the CD4 count was associated with the number of targeted 
peptides (P = 0.0032; r = 0.38) (Fig. 4B) and showed a trend 
toward correlation with the total magnitude of responses (P = 
0.0666) (Fig. 4D). 

The frequency of peptide recognition is influenced by the 
relative conservation of the peptide, immunoproteasomal cleav* 
age sites, and C-terminal forbidden amino acids. Besides HLA 



allele frequencies, factors such as viral sequence heterogeneity 
and processing preferences may contribute to the clustering of 
CTL responses. We took advantage of the extensive data set 
generated in this study to explore the factors that may deter- 
mine peptide recognition by analyzing sequence heterogeneity, 
the distribution of immunoproteasomal cleavage sites, and 
amino acid residues known to impair epitope binding to class 
I molecules when present at the C terminus. 

Sequence variability was addressed by creating a variability 
score based on the average Shannon entropy for each position 
in the peptide, using the B clade reference set from the HIV 
database at Los Alamos for our reference alignment. Figure 5 
demonstrates that on a genomewide level, conserved peptides 
displaying low entropy scores were more frequently targeted 
than variable peptides. 

Enrichment for increased predicted immunoproteasomal 
scores (34) in the C-terminal halves of peptides would be 
expected to increase the generation of optimal peptides with a 
variety of C termini which would bind to different HLA alleles. 
On the other hand, an increased frequency of amino acid 
residues rarely seen at the C termini of optimally defined CTL 
epitopes (forbidden amino acids — G, P, E, D, Q, N, T, S, and 
C) (9, 15, 21) would be expected to limit the number of optimal 
epitopes that could be generated from a longer peptide se- 
quence. The data presented in Fig. 6A to C show that the 
cleavage preference was directly correlated with the frequency 
of recognition of a given peptide (P = 3 x lO"'*; r = 0.18), 
while the content of forbidden amino acids and the variability 
of a peptide were inversely correlated with the frequen<7 of 
recognition (P = 1.2 x 10-^ r = -0.21, andP = 1.5 X lO'^ 
r = -0.25, respectively). The three described variables did not 
correlate with each other, although there was a trend toward 
an inverse correlation between the cleavage score and the 



TABLE 6. Highly reactive regions to be included in vaccine design"* 



OLP 


Protein 


11-12 


Gag-pl7 


22-25 


Gag-p24 


28-30 


Gag-p24 


35-37 


Gag-p24 


40-42 


Gag-p24 


59 


Gag-pl5 


281 


Vpr 


293-295 


gpl20 


75-79 


Nef 


81^5 


Nef 



Sequence 



TGSEELRSLYNTVATLYCVHQRIEVKDTKEALEK 

WVKVVEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKE 

LKETINEEAAEWDRLHPVHAGPIAPGQMREPRGSDL\GTTS 

WMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKE 

GPKEPFRDYVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDC 

TRRQANFLGKIWPSHKGRPGNFLQSRP 

ELKNEAVRHFPRIWLHSLGQHIYETYG 

AAEQLWVTVYYGVPVWKEATTTLFCASDACAYDTEVHNVWA 

WLEAQEEEEVGFPVRPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIYSQKRQDILDLWV 
QKRQDILDLWVYHTQGYFPDWQNYTPGPGIRYPLTFGWCFKLVPVEPEKVEEAN 



Length* 



34 
49 
41 
43 
43 
27 
27 
41 
58 
25 



« Peptides were included when recognized by at least 20% of the overall population, and adjacent peptides were added when targeted by >10% of the ovcraU cohort. 
* Number of amino acids. 
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FIG. 3. Effective antiretroviral treatment reduces CTL activity. EUSpot responses are shown for 61 untreated (— T^; squares) and 89 treated 
(+Tx; diamonds) subjects. The latter were further divided into 42 non-treatment-suppressed (viral load [VL] > 200; triangles) and 44 treatment- 
suppressed (VL £ 200; inverted triangles) individuals. Shown are the breadth of responses (number of targeted peptides) (A) and the total 
magnitude (number of SFC per million PBMCs) (B) for the different groups. The thick lines represent the median values for each group, values 
were determined by Mann-Whitney tests. 



entropy (P = 0.03; r = -0.10), suggesting that greater variabil- 
ity is associated with less efficient processing. Importantly, while 
each factor is independently related to the T-cell response— the 
effect of the entropy was more pronounced than those of the 
forbidden residues and the cleavage scores — ^the presence of 
two or more of these factors is cumulatively predictive for the 
frequency of recognition (Fig. 6D), Thus, the data indicate that 
peptides displaying low entropy, a low fraction of forbidden 
amino acids, and a high cleavage score are most frequently 
recognized in these in vitro studies, and antigens with these 
characteristics may represent potent vaccine candidates. 

DISCUSSION 

Since the first description of fflV-specific CTL, numerous 
studies have supported the important role of this arm of the 
cellular immune system, and a considerable number of vaccine 
development efforts are directed toward the design of CTL- 
based vaccination strategies (10, 39, 54). Not only do global 
and local viral sequence diversities represent a major hurdle 
for vaccine development, but host genetics, especially HLA 
distribution, may also need to be taken into account when 
designing vaccine candidates to elicit strong CTL responses. In 
order to identify immunodominant regions in HIV that are 
targeted largely independently of HLA diversity, we assessed 
total HIV-specific CDS"^ -T-cell responses in a cohort of 150 
HIV clade B-infected individuals of African-American, Cau- 
casian, Hispanic, and West Indian descent, thus including the 
ethnicities in which the HIV epidemic progresses fastest in the 
United States (31). Total CD8-*'-T-cell IFN-7 ELISpot re- 
sponses were found to correlate with individuals' CD4 counts, 
but not viral loads, although CTL responses were higher in 
untreated individuals and individuals who obtained treatment 
but were unable to control viral replication. The single-pep- 
tide-based screening of this genetically diverse cohort allowed 
the identification of 10 regions that were heavily targeted in the 



overall cohort and thiis represent responses that are not re- 
stricted to a certain ethnicity. The present study also identifies 
three mechanisms which, independently from each other, pre- 
dict the likelihood of single peptides being recognized in these 
in vitro studies. These factors — peptide entropy, the presence 
of immunoproteasomal cut sites, and the absence of amino 
acids that impair binding of optimal epitopes to the restricting 
class I molecules— may thus be employed to identify immuno- 
dominant regions that could be included in vaccine design, not 
only for HIV, but also for other pathogens. 

In order to assess the extent of ethnicity-specific responses 
that may be present in one ethnic group but absent in others 
and to investigate whether these responses are due to HLA 
diversity and potential differences in the HLA class I allele- 
specific binding motifs, frequently targeted responses were 
compared among ethnicities. Only a few HLA alleles were 
found to differ significantly in their distributions among the 
ethnicities studied, most prominently HLA All and HLA A25, 
and the analysis of ethnicity-based peptide targeting revealed 
only six peptides with significantly altered frequencies of rec- 
ognition. In each case, specific HLA class I alleles could be 
implicated as likely mediating the observed effect, although 
their distributions among ethnicities did not reach statistical 
significance in most cases. It is possible that peptides may carry 
multiple overlapping epitopes that can be presented by differ- 
ent HLA alleles, and this redundancy may overshadow specific 
HLA differences in population frequencies. 

Furthermore, the cohort was also analyzed for the presence 
of HLA alleles associated with slow or fast disease progression 
(reviewed in reference 52). While ethnic groups with a high 
representation of these HLA alleles did not show significant 
differences in the viral loads of untreated individuals, our data 
confirmed the association of HLA-B57 with low plasma viral 
loads when the analysis was performed across ethnicities. 

This study, the first to determine the total HIV-specific 
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FIG. 4. CTL activity correlates with CD4 counts but not viral loads in untreated subjects. Association between the number of targeted peptides 
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subjects with CD4 counts of >1,000 cells/pJ are excluded (P = 0.0085). CD4 counts were available for 59 subjects. 



CD8'*'-T-cell response in a large and predominantly non-Cau- 
casian cohort, revealed an unprecedented breadth in the dis- 
tribution of responses. All ethnicities were shown to mount 
CTL responses against every HIV protein, and <12% of the 
peptides were not targeted by any individual. This supports the 
usefulness of a consensus sequence-based approach, in which 
the test peptides on average resemble the individual's autolo- 
gous virus more closely than peptide sets based on specific 
viral-isolate sequences (24, 55). 

The present study included individuals with and without 
current antiretroviral treatment and with a wide range of viral 
loads and CD4 counts, allowing correlations of these parame- 
ters with immune responses in a large number of subjects. 
Surprisingly, when the breadths and magnitudes of responses 
in treated and untreated individuals were compared, no differ- 
ence in CD8***-T-cell activity was observed. Only when the 
treated group was further stratified into treatment-suppressed 
and non-treatment-suppressed individuals were significant dif- 
ferences found in the breadth and magnitude of the virus- 



specific T-cell responses. These were significantly higher in 
both the untreated and the non-treatment-suppressed groups 
than in individuals controlling their viral loads to s200 RNA 
copies/ml, extending earlier reports demonstrating that CTL 
responses decline under effective treatment and may, at least 
partially, fall below the limit of detection of current assays (4, 
29, 30, 32). 

Although this finding would suggest that the antigen load is 
the driving force for the immune response, we were unable to 
establish a correlation between the plasma viral load and either 
the breadth or magnitude of HIV-specific CDS"^ -T-cell re- 
sponses, in contrast to previously published reports from other 
laboratories, which have reported positive (7, 13, 42) or nega- 
tive (14, 18, 22, 42, 45. 53) correlations between these param- 
eters. However, many of these earlier studies were performed 
on smaller cohorts and were less comprehensive than the 
present study. This does not, however, rule out the possibility 
that in the large amount of data generated by these compre- 
hensive screenings, relevant immune responses may be ob- 
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FIG. 5. The frequency of peptide recognition is inversely correlated with peptide entropy. The number of subjects recognizing each peptide and 
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FIG. 6. (A to C) Influence of entropy (A), fraction of forbidden amino acids (AA) (B), and proteasomal cleavage scores (C) on the frequency 
of recognition of OLPs. Entropy and the fraction of forbidden amino acids correlate inversely {P < 0.0001) and cleavage scores correlate positively 
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soured, and that some specificities may indeed correlate with 
viral loads. A further factor that should be considered is that 
our study was cross-sectional and the relative times from in- 
fection when the samples were obtained were not known. In 
addition, the possibility that responses to autologous virus 
were missed by using the reference strain of HIV employed 
here must be considered (2). 

While the viral load was not associated with CD8"^-T-cell 
activity, CD4 counts were found to positively correlate with the 
number of targeted peptides. Overall, there was also a trend 
toward a correlation between the magnitude of CD8'^-T-cell 
responses and the CD4 count which was statistically significant 
when only the responses against the more conserved proteins, 
Gag and Pol, were considered, partly supporting the findings of 
Edwards et al; (22), who reported a positive correlation be- 
tween Gag responses and CD4 counts. Thus, while antigen 



presence above a threshold seems to be required to maintain a 
full CD8"^-T-cell response, CD4 counts are, on a large-popu- 
lation level, a better predictor of the breadth and magnitude of 
HIV-specific CDS'^-T-cell responses. However, the mecha- 
nisms behind this association remain unclear, as broad CTL 
responses could be crucial to maintaining 004"*" T cells, or an 
elevated CD4 count could be necessary to enable the immune 
system to mount a broad CD8'*"-T-cell response. Further anal- 
ysis will be necessary to understand the relationship between 
the CD4 count and CTL responses, especially studies address- 
ing the antigen specificities of these CD4'*' T cells. 

Overall, the present data demonstrate that almost the entire 
HIV protein sequence can be targeted by 008"^ T cells. Thus, 
all HIV proteins are subject to antigen processing and presen- 
tation and could be included in HIV vaccines. However, de- 
spite this wide distribution of responses, the targeted peptides 
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seemed to cluster in highly reactive regions, especially within 
the HIV Gag and Nef proteins. This pattern was remarkably 
conserved among all four ethnic groups, allowing the identifi- 
cation of 10 reactive regions, each recognized by at least 20% 
of the overall cohort. These clusters were so consistently tar- 
geted between ethnicities that the combination of as few as 
three regions, each comprising 30 to 45 amino acids, was suf- 
ficient to achieve population coverage of >90%. As these 
regions represent potential vaccine candidates, it will be inter- 
esting to assess whether the same regions will be targeted in 
individuals of additional ethnicities or individuals infected with 
viruses other than clade B. Indeed, 6 of these 10 highly immu- 
nogenic clusters were also reported in the extensive studies of 
HIV clade C-infected individuals in Botswana performed by 
Novitsky et al. (41). These findings strongly suggest that factors 
other than host genetics may dictate which regions of the virus 
are immunogenic for HLA class I-restricted T cells. 

There are several factors that could influence the observed 
CTL epitope clustering, among them sequence heterogeneity 
(9, 26, 38, 55). Yusim et al. used the complete set of optimal 
HIV-derived CTL epitopes included in the Los Alamos HIV 
Immunology Database to show that regions of HIV-1 proteins 
that were epitope rich (i) were conserved, (ii) had a high 
propensity to be cleaved by the immunoproteasome, and (iii) 
had a lower frequency of amino acids that do not serve as 
C-terminal anchor residues compared to regions of proteins 
that had a paucity of CTL epitopes (55). However, the study 
was based on tallying how many previously defined unique 
epitopes overlapped in a region and was unable to address how 
these factors related to the frequency of recognition. In con- 
trast, in the present study, we were able to directly assess 
whether sequence entropy, the presence of proteasomal cleav- 
age sites, or the frequency of forbidden residues correlated 
with the frequency at which a specific peptide was targeted; we 
applied a systematic approach to a genetically heterogeneous 
population, and our peptides completely spanned all HIV pro- 
teins for a comprehensive comparison. The three factors vari- 
ability, proteasomal cleavage propensity, and low frequencies 
of forbidden amino acids were all found to be strong and 
independent predictors of the number of detectable CTL re- 
sponses directed at a peptide. Importantly, since sequence 
entropy was not the only factor that predicted peptide reactiv- 
ity, using autologous peptide test sequences might not com- 
pletely eliminate epitope clustering, and autologous test se- 
quences should still produce fewer reactive peptides in regions 
with selectively reduced proteasomal cleavage sites and HLA 
class I binding potential. 

The relatively reduced number of reactive peptides in vari- 
able regions could be in part a consequence of these regions 
being less functionally constrained. Specific CTL escape mu- 
tations in variable regions may have a greater capacity to cu- 
mulatively diminish the overall potential for epitope processing 
and presentation at the population level (40, 55), as the fitness 
costs of escape may be less able to drive reversion to the 
susceptible form upon transmission to a new host, where spe- 
cific immune pressure is lifted. This concept is supported by 
the finding that the predicted frequency of proteasomal cleav- 
age sites showed a trend toward an inverse correlation with 
peptide entropy, suggesting that sequence variability may re- 
flect viral adaptation and antigen-processing escape. These 
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three correlates of CTL immunogenicity combined may drive 
the observed epitope clustering, allowing the identification of 
interindividually highly immunodominant peptides with pat- 
terns that hold across different ethnicities. 

Taken together, the results of the present study, performed 
on an extensive population with diverse genetic backgrounds, 
describe highly immunogenic peptides that may be incorpo- 
rated into HIV vaccine candidates independently of host ge- 
netics. The study also provides mechanisms that are at least 
partially responsible for the in vitro immunodominance and 
likely high in vivo immunogenicity of these peptides. A com- 
bination of these regions or proteins, along with additional 
vaccine components that are able to induce strong T helper cell 
responses and potentially neutralizing antibodies, should con- 
tribute to the design of an effective HTV vaccine (48, 50). 
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